50 research outputs found

    Retrieval of bilingual Spanish-English information by means of a standard automatic translation system

    Get PDF
    This paper describes our participation in bilingual retrieval (queries in Spanish on documents in English), by means of an information retrieval system based on the vector model. The queries, formulated in Spanish, were translated into English by means of a commercial automatic translation system; the terms extracted from the resulting translations were filtered in order to get rid of empty words and then they were normalised by stemming. Results are poorer than those obtained through monolingual retrieval with the original queries in English slightly above 15%

    Web Page Retrieval by Combining Evidence

    Get PDF
    The participation of the REINA Research Group in WebCLEF 2005 focused in the monolingual mixed task. Queries or topics are of two types: named and home pages. For both, we first perform a search by thematic contents; for the same query, we do a search in several elements of information from every page (title, some meta tags, anchor text) and then we combine the results. For queries about home pages, we try to detect using a method based in some keywords and their patterns of use. After, a re-rank of the results of the thematic contents retrieval is performed, based on Page-Rank and Centrality coeficients

    La cibermetría en la recuperación de información en el Web

    Get PDF
    The exponential growth of web and distributed data characteristics, high volatility, unstructured data, redundant and highly heterogeneous, have introduced new problems in information retrieval processes. Therefore it is necessary to open new avenue of research that allow us to obtain good levels of accuracy. The papers are based on exploiting the hypertext features of the site is reaching great fame. The cybermetrics is providing many options for working with links and is offering some interesting options at this time, and much of the techniques used in the same may be useful in the processes of information retrieval on the web

    Science and Technology in Social Networks: Twitter

    Get PDF
    Using the Internet as a primary source of scientific information search it is enhanced with the use of social networks. This requires a study and standardization of the content obtained in this way. Through the study of scientific information on twitter profiles could identify the main issues and quantity and quality of shared scientific information

    Líneas de investigación en web mining, extracción automática de conocimiento y redes sociales

    Get PDF
    REINA's research lines in Social Network Analysis. Application to Topic Detection & Tracking

    Scientific culture: Public perception of Science and Technology in Spanish Wikipedia

    Get PDF
    The paper shows the relationship between Science and Technology Wikipedia articles (Spanish edition), using Social Network Analysis Techniques

    REINA at RepLab2013 Topic Detection Task: Community Detection

    Get PDF
    Social networks have become a large repository of comments which can extract multiple information. Twitter is one of the most widespread social networks and larger and is therefore an important source for detecting states of opinion, events and happenings before even the mainstream media. Topic detection is important to discover areas of interest that arise in the tweets. We have used classical systems for a similarity matrix and we have used community detection techniques. The results have been good and allows us to study new possibilities

    Experiencias en la utilización de metodologías no presenciales de aprendizaje en la impartición de la asignatura Informática aplicada a la traducción

    Get PDF
    The experience in the use of e-learning to teach an official university course was shown in the text. The degree of utilization of e-learning tools was pointed out, and also the effort of students and teachers to achieve the skills of them was analyzed. It allows us to evaluate if the formative activity of the students using such systems needs big efforts of adjustment. It is an element of important valuation that can be applied in the learning of the Translation and Interpretation studies adapted to the EEES system

    Cibermetría del Web: las leyes de exponenciación

    Get PDF
    An introduction to the power laws, enunciated by Michalis Faloutsos, is made and that allows us to make a characterization of the Web through the analysis of their topology. Their most important characteristics are described and how calculate some of the values of the most interesting functions

    Análisis de temas emergentes a través de Twitter

    Get PDF
    Analysis of emerging issues in social networks applies to the views expressing individual users, to control activities and acts of associations, analyze political campaigns or study the impact of advertising campaigns by companies. For detection of these issues the algorithm Latent Dirichlett Allocation shall apply to a set of profiles in the field of information and documentation, in order to know the topics covered in these groups and to assess whether the detection system is reliable. The approach works correctly, and provides reliable results.El análisis de temas emergentes en las redes sociales se aplica para conocer las opiniones que expresan usuarios individuales, para controlar actividades y actos de asociaciones, analizar las campañas de los políticos o estudiar el impacto de campañas publicitarias por parte de las empresas. Para la detección de dichos temas se aplicó el algoritmo Latent Dirichlett Allocation a un conjunto de perfiles del ámbito de la información y documentación, con el fin de conocer los temas que se tratan en dichos grupos y para evaluar si el sistema de detección es fiable. El sistema funciona correctamente y proporciona resultados fialbes
    corecore